Coherence and Replacement Protocol of DICE-A Bus-Based COMA Multiprocessor
نویسندگان
چکیده
As microprocessors become faster and demand more bandwidth, the already limited scalability of a shared bus decreases even further. DICE, a shared-bus multiprocessor, utilizes cache only memory architecture (COMA) to effectively decrease the speed gap between modern high-performance microprocessors and the bus. DICE tries to optimize COMA for a shared-bus medium, in particular to reduce the detrimental effects of cache coherence and the ``last memory block'' problem on replacement. In this paper, we present the coherence and replacement protocol of the DICE multiprocessor and its design trade-offs. We describe a four-state write-invalidate coherence protocol in detail. Replacement, which poses a unique overhead problem of COMA, requires that a victim block with ownership be relocated to a remote node in order not to discard the last cached memory block. We show that the relocation process can be efficiently implemented by using a temporary storage called relocation buffer and a priority-based selection algorithm. We present performance results that show a drastic reduction in global bus traffic compared to a traditional shared-bus multiprocessor architecture. 1999
منابع مشابه
Global Bus Design of a Bus-Based COMA Multiprocessor DICE
DICE is a shared-bus multiprocessor based on a distributed shared-memory architecture, known as Cache-Only Memory Architecture (COMA). Unlike previous COMA proposals for large-scale multiprocessing, DICE utilizes the COMA to effectively decrease the gap between modern high-performance microprocessors and the bus. As microprocessors become faster and demand more bandwidth, the already limited sc...
متن کاملPii: S0141-9331(98)00097-0
DICE is a shared-bus multiprocessor based on a distributed shared-memory architecture, known as cache-only memory architecture (COMA). Unlike previous COMA proposals for large-scale multiprocessing, DICE utilizes COMA to effectively decrease the speed gap between modem high-performance microprocessors and the bus. DICE tries to optimize COMA for a shared-bus medium, in particular to reduce detr...
متن کاملProtocol Mapping for a Bus-Based COMA Multiprocessor
As microprocessors become faster and demand more bandwidth the already limited scalability of a shared-bus decreases even further. Cache only memory architecture (COMA) for distributed shared memory multiprocessors has a potential to effectively decrease the gap between high performance microprocessors and bus. The concept of COMA is organizing traditional main memory as a large cache. COMA all...
متن کاملOn timing constraints of snooping in a bus-based COMA multiprocessor
Cache only memory architecture has the potential to decrease global bus traffic in shared-bus multiprocessors, thereby reducing the speed gap between modem microprocessors and global backplane bus systems. However, the (huge) size of attraction memory (AM) in each processor node makes it difficult to properly match the access time of its state and tag storage to the bus cycle. This becomes a se...
متن کاملRemoving Timing Constraints of Snooping in a Bus-Based COMA Multiprocessor
Cache Only Memory Architecture has potential of decreasing global bus traffic in shared bus multiprocessors, reducing the speed gap between modern microprocessors and global backplane bus systems. However, the (huge) size of Attraction Memory (AM) in each processor node makes it difficult to properly match the access time of its state and tag storage to bus cycle. This becomes a serious burden ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- J. Parallel Distrib. Comput.
دوره 57 شماره
صفحات -
تاریخ انتشار 1999